Identifying operons and untranslated regions of transcripts using Escherichia coli RNA expression analysis
نویسندگان
چکیده
Microarrays traditionally have been used to assay the transcript expression of coding regions of genes. Here, we use Escherichia coli oligonucleotide microarrays to assay transcript expression of both open reading frames (ORFs) and intergenic regions. We then use hidden Markov models to analyse this expression data and estimate transcription boundaries of genes. This approach allows us to identify 5' untranslated regions (5' UTRs) of transcripts as well as genes that are likely to be operon members. The operon elements we identify correspond to documented operons with 99% specificity and 63% sensitivity. Similarly we find that our 5' UTR results accurately coincide with experimentally verified promoter regions for most genes.
منابع مشابه
Unprecedented High-Resolution View of Bacterial Operon Architecture Revealed by RNA Sequencing
We analyzed the transcriptome of Escherichia coli K-12 by strand-specific RNA sequencing at single-nucleotide resolution during steady-state (logarithmic-phase) growth and upon entry into stationary phase in glucose minimal medium. To generate high-resolution transcriptome maps, we developed an organizational schema which showed that in practice only three features are required to define operon...
متن کاملEnhancing transcription through the Escherichia coli hemolysin operon, hlyCABD: RfaH and upstream JUMPStart DNA sequences function together via a postinitiation mechanism.
Escherichia coli hlyCABD operons encode the polypeptide component (HlyA) of an extracellular cytolytic toxin as well as proteins required for its acylation (HlyC) and sec-independent secretion (HlyBD). The E. coli protein RfaH is required for wild-type hemolysin expression at the level of hlyCABD transcript elongation (J. A. Leeds and R. A. Welch, J. Bacteriol. 178:1850-1857, 1996). RfaH is als...
متن کاملChlamydial gene encoding a 70-kilodalton antigen in Escherichia coli: analysis of expression signals and identification of the gene product.
In an attempt to identify chlamydial genes whose native promoters allow them to be expressed in Escherichia coli, we isolated and characterized a chlamydial gene identified by screening a library of chlamydial DNA with antichlamydial antibodies. This gene encodes a 70-kilodalton immunoreactive polypeptide in E. coli hosts. Sequence analysis of the 5' portion of the gene identified its product a...
متن کاملDetection of non-coding RNA in bacteria and archaea using the DETR'PROK Galaxy pipeline.
RNA-seq experiments are now routinely used for the large scale sequencing of transcripts. In bacteria or archaea, such deep sequencing experiments typically produce 10-50 million fragments that cover most of the genome, including intergenic regions. In this context, the precise delineation of the non-coding elements is challenging. Non-coding elements include untranslated regions (UTRs) of mRNA...
متن کاملGlobal analysis of posttranscriptional regulation by GlmY and GlmZ in enterohemorrhagic Escherichia coli O157:H7.
Enterohemorrhagic Escherichia coli (EHEC) is a significant human pathogen and is the cause of bloody diarrhea and hemolytic-uremic syndrome. The virulence repertoire of EHEC includes the genes within the locus of enterocyte effacement (LEE) that are largely organized in five operons, LEE1 to LEE5, which encode a type III secretion system, several effectors, chaperones, and regulatory proteins. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 18 Suppl 1 شماره
صفحات -
تاریخ انتشار 2002